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PRACTICAL APPLICATION OF STANDARD TESTS IN 
SPELLING, LANGUAGE, AND ARITHMETIC 



JOSEPH P. O'HERN 

Assistant Superintendent of Schools, Rochester, New York 



Rochester has done so much for its school children along the 
lines of manual training, domestic science, domestic art, music, 
drawing, physical education, etc., that the question has often been 
raised as to whether or not the subjects of reading, spelling, arith- 
metic, language, etc., were receiving the necessary time and atten- 
tion. Monthly tests and city-wide term examinations indicated 
a satisfactory condition. These tests, however, were made up 
within the system and were therefore local. The use of standard 
tests was started in order to find out how Rochester would compare 
with other cities in the so-called formal subjects. The more we 
use standard tests the more we are convinced that these tests are 
useful, not only for the purposes of diagnosis and comparison, but 
also for the purpose of locating new pupils in the right grade. 
This is especially true of the Judd Arithmetic Test, which may be 
used in grading new pupils who come from other school systems 
at all times of the year. 

SPELLING TESTS 

In spelling three tests were given: the Buckingham ten-point 
spelling scale and two Ayres speUing tests. In the Buckingham 
test the scores were high in all grades from the third to the eighth 
inclusive. 

For tne first Ayres spelling test the same words were chosen 
that were given in Springfield, Butte, Oakland, and Salt Lake City. 

The second Ayres spelling test was the same as that chosen by 
the Cleveland Survey Committee. The low score in the fourth 
grade was due to the fact that a number of words in the examination 
were not taught in this grade. In general, however, most of the 
words were found in our course of study for the grade concerned. 
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The results of these standard spelling tests justified the time limit 
we had set for speUing, which is about 2 per cent less than the time 
given to spelling in most city school systems. 

TABLE I 

Comparison of the Results from the Buckingham Spelling Scale 
IN Rochester and New York 





Grade 


Average 




III 


IV 


V 


VI 


VII 


viii 


New York 

Rochester 


30 

35 


47 
70 


60 
80 


70 

8s 


78 

91 


85 
95 


62 

77 


Difference. . . 


+ 25 


+ 23 


+20 


+ IS 


+13 


+ 10 


+ 15 



TABLE II 

Comparison of the Results from the First Ayres Spelling Test 
WITH Other Cities 







Grade 






III 


IV 


v 


VI 


VII 


VIII 


65.0 


70.0 


72.0 


68.0 


73-0 


75 -o 


81.8 


78.7 


84-5 


75-0 


76.2 


89.4 


66.7 


75-9 


84.7 


80.2 


79-9 


76.3 


89.9 


78.8 


87.6 


86.8 


87.1 


82.2 


77-8 


77-4 


85.5 


82.9 


83.1 


84.9 



Average 



Springfield, 111 

Butte, Mont 

Oakland, Cal 

Salt Lake City, Utah. 
Rochester, N.Y 



70.0 
80.3 

76.5 
86.0 
81. 9 



Note. — In Butte and Salt Lake City this test included only the B sections of the 
grade while in Oakland and Rochester it included both A and B sections. 



TABLE III 

Comparison of the Results from the Cleveland Survey Ayres Spelling Test 
WITH THE Ayres Standard 





Grade 






III 


IV 


V 


VI 


VII 


VIII 


Aver- 
age 




B 


A 


B 


A 


B 


A 


B 


A 


B 


A 


B 


A 






73.0 
73-2 


81.4 
81.9 


73.0 
68.6 


78.9 
71.4 


73.0 

71. S 


78.9 
76.8 


8o!9 


78.9 
84.6 


73.0 
79-5 


tt 


73.0 
81.6 


78.9 
84.9 






77. 5 






Difference 


+0.2 


+O.S 


-4.4 


-7-5 


-l.S 


—2.1 


+7.9 


+S.7 


+6.5 


+6.3 


+8.6 


+6.0 


+1.S 
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TABLE IV 

Comparison of the Results from the Cleveland Survey Ayres Spelling Test 
WITH THE Rochester Scores 









Grade 










Ill 


IV 


V 


VI 


VII 


VIII 




Cleveland 

Rochester 


78.0 

77-S 


73 
69.9 


75 -o 
74.0 


78.0 
82. s 


76.0 
81.9 


80.0 
833 


76.0 

77-S 


Difference . . 


-o-S 


-31 


— 1 .0 


+4-5 


+S-9 


+i-i 


+ I.S 



III 



IV 



VI 



Ayres Standard 
(average 84 cities) 

Cleveland 

Rochester 



VII VIII 

TIOO 




LANGUAGE TESTS 

The Kansas Silent-Reading Test was given in all grades of 
the city, from the third to the eighth. The scores were slightly 
above the standard in the third, fourth, and fifth, slightly below 
in the sixth, equal in the seventh, and somewhat below in the 
eighth grade. The silent-reading test showed that the schools 
having a large foreign population fell below the standard median, 
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while schools in the English-speaking sections ran well above 

the standard. 

TABLE V 

Comparison of the Results from the Kansas Silent-Reading Test 
WITH THE Standard 





Grade 




III 


IV 


V 


VI 


VII 


VIII 


Standard 


SO 
5-9 


9-4 
9.9 


134 

13-7 


13-8 
13.6 


16.5 
16. s 


19.2 

17.4 


Rochester 






Difference 


+0.9 


+O.S 


+°.3 


—0.2 





-1.8 



THORNDIKE S VISUAL-VOCABULARY TEST 

In order to test the power of word-recognition and word- 
meaning the Thorndike Visual- Vocabulary Test was given in 
eleven schools in Grades IIIB to VIII A inclusive, and in the B 
and A divisions of the third grade in eight additional schools, all 
largely foreign except one. 



Comparison of the 


TABLE VI 

Results from the Thorndike Visual- Vocabulary 
WITH Eighteen Cities of Indiana 


Test 




Grade 




III 


IV 


V 


VI 


VII 


VIII 




B A 


B 


A 


B 


A 


B 


A 


B 


A 


B 


a 




4 4 
4 


4i 

5 


Si 

26 


si 


f, 


6i 
6 


7 
66 


7§ 
7 


8+ 


St 


8 


Indiana 


6 


29 


7 


91 









In the visual-vocabulary test Thorndike attempts to test the 
understanding of the meaning of words. There are nine steps in 
this scale, five words each, the steps being equal in difficulty. There 
is increasing difficulty from line to line. The words were selected 
from a long list and tested at first by the examination of about 
fifteen hundred children. The pupils use a letter to show what the 
word represents, A for animal, F for flower, etc. This test is 
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valuable because it determines whether the child grasps the mean- 
ing of the printed word. It gives undue prominence to names of 
animals and flowers. No standard grade scores have been deter- 
mined. It is impossible, therefore, to compare the record of the 
Rochester schools with a standard score. A record, however, was 
made in eighteen cities in Indiana that may serve for the time 
being for such a comparison. On that basis the record in Rochester 
is strikingly similar to that of the cities of Indiana. 

The record in the various schools is very even and consistent 
from grade to grade, except in the foreign schools, where thirteen 
groups failed to score at all in the III B and six groups failed to score 
in III A. 

The principal in a local school, which is 90 per cent Italian, 
made a very complete report of the results attained in his 
school. The final statement in this report might be quoted as an 
explanation of the failure of so many groups to score in III B 
and III A. 

This scale was uninspiring from the standpoint of results. The element 
of nationaUty enters even to a greater extent than in the Trabue scale. Chil- 
dren who hear little or no English spoken at home and whose opportunities for 
reading outside of school are very Umited had a hard time with it. Such 
children's vocabulary of words taken apart from the context of a paragraph 
is very limited. 

Some erratic tendencies are noticed in these results. A great many indi- 
vidual children succeeded in passing certain steps after having failed in one, 
two, or three of the preceding steps. To some extent this may have been due 
to the type of words occurring in the steps. It was very evident from the 
results that words belonging to the general classes of animals or flowers were 
much more familiar than the words expressing time or the moral qualities 
classified under "Good" or "Bad." 

The fact that the IV A results were better uniformly than those from the 
VB, and the VA results better than all others, reveals a condition requiring 
further study. These results are in accord, however, with those attained by 
the corresponding grades in the Trabue test. In general the over-age children 
did poorly in this test. This was to be expected, however, since in a majority 
of cases it is inability to use English that has caused their retardation. 

At least two lines of remedial effort are suggested by these results, viz., 
(1) more emphasis on the meaning of words occurring in reading and spelling 
lessons with practice in using the same in sentences; (2) more dictionary work 
in intermediate and upper grades. 
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THORNDIKE S SCALE ALPHA 

The Thorndike Scale Alpha consists of a series of paragraphs. 
The child's understanding is tested by requiring him to answer 
questions and to follow directions concerning the paragraph. The 
scale is designed for pupils from the third grade to the high school. 
It lacks the advantage of a connected story, and the suitability of 
subject-matter might possibly be questioned as being devoid of 
interest to many pupils. 

As there was some question as to the method of scoring the 
results in this test, it was decided for the sake of uniformity that 
the total class average of correct answers to questions would be 
used as the score. 

Since no standard scores are available the results are chiefly- 
valuable for purposes of internal comparison. In the schools where 
the test was given in all grades the record for the number of divisions 
of a grade which were above or below the median is shown in 
Table VII. Schools C, D, G, and K in this table are predomi- 
nantly foreign in nationality, Schools E, H, and I are mixed in 
nationality, Schools A, B, and F are American, and School J is a 
village school recently annexed to the city. This comparison would 
seem to prove the point quoted above, that the element of nation- 
ality is a very strong factor. The schools having the larger number 
of divisions above the median might be considered as favored com- 
munities in point of environment, home influences, and oppor- 
tunities. 

TABLE VII 



School 


Above Median 


Below Median 


School 


Above Median 


Below Median 


A 


II 


I 


G 


I 


7 


B 


II 


I 


H 


8 


4 


C 


4 


8 


I 


6 


2 


D 


I 


7 


J 





12 


E 


6 


6 


K 





4 


F 


10 


2 









In comparing the results of the visual- vocabulary test with the 
results of the scale-alpha test it is evident that word-recognition is 
easier than content-reproduction. It may be that too many of our 
school children learn to pronounce words without being made fully 
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conscious of their meaning, but in the absence of standards it is a 
problem for further investigation. 

The results obtained from the use of scale alpha would seem to 
emphasize the fact that silent reading with understanding is more 
important and should receive more attention than oral reading. 

TABLE VIII 
Thorndike Scale Alpha 





Grade 




ni 


IV 


V 


VI 


VII 


VIII 




B 


A 


B 


A 


B 


A 


B 


A 


B 


A 


B 


A 


Rochester median . . 


8.3 


12. I 


13-9 


IS. 8 


17.4 


17.9s 


18.7 


18.9 


13-9 


IS. 4 


16.4 


17.6 



TRABUE COMPLETION LANGUAGE SCALE B 

The Trabue Completion Language Scale B was given in eleven 
schools from Grades III B to VIII A inclusive and in the B and A 
divisions of the third grade in the seven schools that comprise the 
Washington Junior High School District. The eleven schools 
represent all the types of schools to be found in a city the size of 
Rochester, the foreign in nationality, the mixed in nationality, and 
the well-to-do sections. 

The Trabue test is merely a list of incomplete sentences arranged 
in such a way that pupils may be expected to complete them, hence 
the name. There are ten sentences in each test. Seven minutes 
are allowed for each test. Two credits are allowed for each sen- 
tence. These tests are considered of value in judging general 
intelligence. 

Dr. Trabue found that the results of his tests agreed with the 
opinion of the teacher as to the pupils' general ability. 

Of these completion-test language scales Dr. Trabue says: 

No attempt has been made to define language in any strict sense, and it 
is entirely possible that some persons may be able to speak the English lan- 
guage and perhaps to write it fairly well without being able to make a very 
high score on these scales. It may also happen that some individuals will be 
found who score well on these language scales and are yet unable to quote a 
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single rule of English grammar. On the whole, however, it will be found that 
ability to complete these sentences successfully is very closely related to what 
is usually called language ability. 

The results of the test in the eleven schools were eminently satis- 
factory from the point of view of achievement (see Table IX). 
Only four groups out of the one hundred and forty groups examined 
were below the standard median. The gain in score from lower to 
higher grades was uniformly consistent. This would tend to con- 
firm the assumption of the author, in arranging the language scales, 
" that older children will not only be able to do the same tasks more 
rapidly and more perfectly than younger children, but that the 
older children will also be able to accomplish perfectly tasks which 
the younger children could not begin to do." 

In only seven cases did a B group of a grade within a school 
score a higher standing than the A group of the same grade. While 
there is variation in the scores within a grade, school by school, 
still the variation is not so marked as to point to lack of uniformity. 

In this test the foreign schools in the upper grades vary little 
from other schools, and in some groups attain a higher median. 

TABLE IX 

Comparison of the Results from the Trabue Completion Test with 
THE Standard 









Grade 








Ill 


IV 


V 


VI 


VII 


VIII 




7-77 
6.00 


10. 17 
8.00 


"-55 
9.80 


12.57 

11.00 


14-37 
12.30 


14-75 
13-30 


Standard 


Difference 


+ 1-77 


-1-2. 17 


+ I-7S 


+ 1-57 


+ 2.07 


+ 1-45 



STARCH GRAMMATICAL SCALE A 

The Starch Grammatical Scale A was given in the seventh and 
eighth grades in eight schools. The scale includes different items 
of grammatical forms and is therefore only a general measure of 
abihty to use correct language forms. Each pupil is given a printed 
copy of the scale and is allowed as much time as he needs. The 
scale directions are as follows: "Each of the following sentences 
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gives in parenthesis two ways in which it may be stated. Cross 
out the one you think is incorrect or bad. If you think both are 
incorrect, cross both out. If you think both are correct, underline 
both." 

The results point to a condition that is unusual, especially in 
the seventh grade, where the score was equal to the score of the 
eighth grade. In fact, the VII A score was higher than the score 
of VIII Band VIII A. 

The difference between VII A and VIII A scores would seem to 
indicate too little emphasis in VIII A and perhaps a corresponding 
overemphasis in VII A. The question of nationality or neighbor- 
hood does not seem to enter. It is a question of balance and right 
emphasis. 

TABLE X 

Comparison op the Results in the Starch Gram- 
matical Scale with the Standard 





Grade 




VII 


VIII 




B 


A 


B 


A 


Rochester 

Standard 


8.4 
8 


8.9 

n 


8.45 
8 


8.8 

■7 








Difference .... 


+ .6 


+ .3 



JUDD-COURTIS ARITHMETIC TEST 

The course of study affirms "on the one hand that the study 
of arithmetic should develop mental power along the lines of atten- 
tion, concentration, judgment, and reasoning; on the other that it 
should give the child such skill in the use of numbers as will enable 
him to perform the fundamental operations with facility and accu- 
racy and in such a manner that the mere mechanics of these pro- 
cesses shall not retard his thinking." The need of adequate ways 
and means of testing instruction in arithmetic has been felt for 
many years. The pioneers in the field of testing in the fundamental 
operations were Rice and Stone. Courtis came next with his 
Series A, which, however, proved unsatisfactory and was replaced 
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by Series B, which represented a distinct improvement over the 
earlier form. Series B has been used extensively throughout the 
country, and as a result reliable standards have been established 
in the attainments of children in the fundamental operations of 
arithmetic. It was felt by the Cleveland Survey Committee that 
"it was necessary to introduce, between the very simple type of 
example in the first series (Courtis Series A) and the highly complex 
type in the second series (Courtis Series B), tests representing types 
of intermediate complexity." As a result, and with the co-operation 
of Mr. Courtis, the Judd arithmetic tests were devised and given 
to the B grades in Cleveland and later to the B and A grades in 
Grand Rapids. The Judd test is composed of four sets in addition 
(A, E, J, M), two in subtraction (B, F), three in multiplication 
(C, G, L), four in division (D, I, K, N), and two in fractions 
(H, 0) . The pupil begins with Set A and takes each set in order. 
The test is spiral in character, and in this respect it differs from the 
Courtis tests. 

The several sets in each operation are arranged in the test in the order of 
their complexity, but with them are interwoven the sets of the other operations. 
Thus a pupil first works on a set of examples in addition, then passes succes- 
sively to sets in subtraction, multiplication, and division before encounter- 
ing addition again. This changing from one type of operation to another 
lessens the strain on the pupil which is involved in a prolonged test of this 
sort.' 

The Judd test was chosen by the Cleveland Survey Committee 
because it was thought that "it would, on the one hand, show the 
general standing of the city as a whole in the fundamentals of 
arithmetic and would, on the other hand, be diagnostic in its 
character, indicating school, class, and individual weakness in each 
of the different types of operations which enter into the solving 
of the more complex examples in each of the four fundamental 
operations." 

The Judd test was given in the same schools as the other stand- 
ard tests. In Cleveland the test was given to the B grades only and 
in Grand Rapids to the B and A grades. Rochester medians are 

' George S. Counts, Arithmetic Tests and Studies in the Psychology of Arithmetic 
(Supplementary Educational Monographs, Vol. i, No. 4, whole No. 4), p. 6. 
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below the Cleveland medians in every B grade in Sets B, H, and K, 
but above in every grade in Sets L and M. 

In B grades 27 medians were above Cleveland medians 
" B " 50 " '' below 

" A " 49 " " above 

" A " 55 " " below 

Medians improve from grade to grade in every division of the 
test. Occasionally the record of the A grade is less than the cor- 
responding B grade, but only sUghtly less in any case. 

Compared with Grand Rapids — 

In B grades 36 medians were above Grand Rapids medians 

" B " 29 » « below 

" B " 3 " « equal to 

" A " 44 " " above 

" A " 26 " " below 

" A " I median was equal to 

The comparison with Grand Rapids is more favorable to Roches- 
ter than the comparison with Cleveland. This may be due to the 
fact that there is more certainty as to the reliability of the Grand 
Rapids scores. Professor Counts says: "A great deal more care 
was taken here than in Cleveland to insure the results against error. 
In Cleveland the teachers were inexperienced in giving tests, while 
in Grand Rapids they were all more or less familiar with the 
Courtis tests." In Rochester the tests were given and timed by 
principals who had taken the courses in tests and measure- 
ments under Professor L. A. Pechstein, at the University of 
Rochester. Testing conditions were standardized and carefully 
supervised. 

A more important way of looking at the scores is from the point 
of view of difficulty. In point of difficulty from simple to complex 
Professor Counts groups the sets as follows: 

1. A, B, C, D 

2. E, F, G 

3- H 

4- I, J, K 

S. L, M, N, O 



APPLICATION OF STANDARD TESTS 673 

Compared with Cleveland in — 

4 medians above Cleveland medians 



I. A, B. C, D 1 „ . , 

' ' [20 below 

2 E F G I ^ " ^^""""^ 

^■^'^''^ lio « below 

f o " above 

3- " \ 4 " below 

/ 7 " above 

4- ^'J'^ \ 8 « below 

12 " above 



5-L.M,N,0 ^^ „ ,^^j^^ 

Compared with Grand Rapids in — 

12s medians above Grand Rapids medians 

17 " below " " " 

2 " equal " 

fi9 " above « « 

^••^''^'^ In " below " 

f o " above « 

3- ^ \ 7 " below " 

fi8 " above " 

'*•■''•'' *^ 1 9 " below " 

[18 « above " " " 

5. L, M, N, O I II " below " 

( 2 " equal " 

The foregoing comparisons would seem to indicate that in the 
more difficult sets of the test Rochester pupils have a slight advan- 
tage. Whether a larger expenditure of time on the fundamental 
operations in the lower grades would seem necessary from the results 
of this one test is a question. The deciding factor should be how 
well the work is done rather than how much time is taken. In all 
probability too much time is taken in some cases and too little in 
other cases. Some pupils and some groups need more time and 
other pupils and other groups less time. Of this the teacher must 
be the judge, save where a test clearly proves the direct necessity 
of a definite course of action. 

A comparison by grades and tests is given in Table XI 
(see p. 677). From this table the following details may be 
selected for comment: 
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SETS A, E, J, M — ADDITION 

Set A . — The medians are below Cleveland in all but IV B and 
VB. The medians are above Grand Rapids in IIIB and A, IVB 
and A, VB and A, VI B, equal in VIA, and below in VII B and A 
and VIII B and A, indicating a need of daily rapid drill in the upper 
grades in order to maintain the skill developed in the lower grades. 

Set E. — This is short, single-column addition. It is evident 
that insufficient attention is given to this type of written work. 

Set J. — This is long, single-column addition. The Rochester 
medians are above Cleveland in all grades except IVB and VB, 
and above Grand Rapids in five and below in four divisions. If 
the same attention had been given to short-column written addition 
as to long single-column addition the test would have shown the 
same satisfactory results. 

Set M. — Above Cleveland in every group and below Grand 
Rapids in three divisions, VA, VII B, and VIII B. This shows 
ample practice in written work. 

SETS B, F — SUBTRACTION 

Set B. — Many pupils added Set B instead of subtracting, which 
may explain the poor record in this set. 

Set F. — This does not mean that children cannot have examples 
under the Cleveland form. (See Course of Study, p. 14, sec. 3D.) 

SETS C, G, L — MULTIPLICATION 

Set C. — Below Cleveland in B grades and above in A grades, 
above Grand Rapids in five and below in five. Pupils misunder- 
stood this form of multiplication, as it is not used in the schools. 
This suggests that it might be well in grades above III B to express 
the work in both horizontal and vertical forms. 

Set G. — ^Above Cleveland in IVB. Other grades practically 
same as Cleveland. Above Grand Rapids in all but three 
divisions. 

Set L. — This is the most difficult set and our medians are above 
Cleveland in all grades B and A and above Grand Rapids in five 
and below in three, VII B, VIII B, and VIII A. 
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SETS D, I, K, N — ^DIVISION 

Set D.— Below Cleveland in IV B, VII B, and VIII B and above 
in V B and VI B , and above Grand Rapids in all but one. This may 
indicate that upper grades have not been using this form of division. 

Set I. — ^Above Cleveland in IV B and V B and the same in 
VI B and VII B . Above in Grand Rapids in all but one. The slight 
falling off in VIII B might indicate a need of drill in this type. 

Set K. — Below Cleveland in all grades, and below Grand Rapids 
in four and above in four. This indicates a distinct need of con- 
tinuous drill on simple long division. 

Set N. — Below Cleveland in all B grades except VI B and above 
in all A grades. Above Grand Rapids in all but one case, where 
grades are equal. This may mean the necessity of more drill on 
simple forms of long division. 

SETS H, O — FRACTIONS 

Set H. — Below Cleveland and Grand Rapids in all grades B 
and A. It is evident that too much was taken for granted in the 
work in fractions. The results indicate the need of more stress on 
the simple forms. 

Set 0. — Below Cleveland and below Grand Rapids. Here again 
the need of more daily drill on simple fractions is apparent. 

Principals and teachers should make a careful study of the 
scores, not only the scores in Rochester, but also the scores in 
Cleveland and Grand Rapids. How do we compare with Cleve- 
land and Grand Rapids as a system, grade by grade and set by set ? 
How do our own schools compare with one another? How do 
grades compare in the various operations, set by set? Is the 
progress of a grade in any school normal, or does it show variation 
— above the median in one operation and below in another, show- 
ing lack of balance, lack of stress, and lack of supervision ? Does 
the record of an individual pupil show variation to such an extent 
that remedial measures should be sought to eradicate the weakness 
or to remove wrong associations in certain phases of number work ? 
We have traced the record grade by grade and set by set and noted 
the cases where a grade is above in all sets, below in all sets, or 
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above in some and below in others. In general there is a consistent 
and stable balance, but exceptions occur here and there. The most 
noticeable variation is in Set H, fractions. In VB, where Set H 
is first given, the variation is from 0.5 lowest to 10. i highest; in 
V A from o to 7 ; in VI B from o to 9 ; in VI A from i to 6 . 8 ; in 
VII B from o to 6.5; inVIIAfroma to 8; in VIII B from o to 6.1, 
and in VIII A from 2 . 3 to 7 . 5 . In no other set is there such pro- 
nounced variation in scores. Variations are bound to appear, owing 
to individual differences, teaching conditions, environment, etc., but 
given average intelligence and normal conditions there should not 
be such pronounced variation as in the case of Set H, where 
Rochester is below Cleveland and Grand Rapids in every grade. 
In Set O we are also below Cleveland and Grand Rapids in all but 
VIII A; still there is not the same pronounced variation grade by 
grade as in the case of Set H. 

Attention is called to chapters v and vi in Counts's Arithmetic 
Tests and Studies in the Psychology of Arithmetic. Chapter v con- 
tains a "comparison of the arithmetical abilities of certain age and 
promotion groups" and chapter vi contains a "comparison of the 
arithmetical abiHties of certain race groups." A division of a num- 
ber of groups into under-age, normal, and over-age groups proved 
to our satisfaction that the conclusions reached by Professor 
Counts in chapter v are applicable to Rochester, namely, that 
"'fast' corresponds to the young and the 'slow' to the old." 
Furthermore, a glance at the arithmetic scores of the various 
schools will show that his conclusions as to the abihties of certain 
race groups would be found true in Rochester. The scores attained 
by the various grades in one school, 90 per cent ItaUan, are invari- 
ably above the median and in several grades the very highest scores 
of all. In V A this school attained the highest score in all but two 
sets. Other schools, mostly foreign, might also be mentioned in 
this connection. There is apparently no handicap from language 
or race in the study of arithmetic. This is quite natural, as num- 
ber is a universal concept apart from any medium of expression 
save the symbol itself. As Professor Counts suggests, there may 
be social factors at work tending to bring about these differences. 
Whatever may be the explanation, the fact remains that in 
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arithmetic the pupils of foreign parentage are in the lead. Finally, 
it is significant to note that the types of errors found by Counts in 
his analysis of Cleveland and Grand Rapids scores are likewise 
revealed in the Rochester results. 

SUMMARY AND CONCLUSION 

From this attempt to measure scientifically the educational 
attainment of school children in the language and arithmetical 
abilities eight points of major significance may be drawn in sum- 
mary: 

Language abilities. — 

1. The development of language ability is being adequately 
secured, so far as the entire city system is concerned. This is 
shown consistently by all the tests used. 

2. The ability to recognize words seems to develop faster than 
that of ascribing meanings and relations. The groups largely for- 
eign in make-up reveal marked weakness in the latter. 

3. The general intelligence of school children seems adequately 
measured by the Trabue Completion Test. The regularity of 
increase throughout the grades argues that the schools are success- 
fully counteracting the handicap of nationality so marked in the 
lower grades. 

4. The correct use of spoken idioms needs continued attention 
through the eight grades. These speech habits cannot be taken for 
granted. 

Arithmetical abilities. — 

1. The entire range of these special-ability tests shows that 
consistent improvement is being obtained from grade to grade. 

2. The city-wide scores argue, after comparison with comparable 
conditions in other cities, that the local work is markedly satis- 
factory. 

3. The harder arithmetical operations are executed relatively 
better than the easier ones, and certain of the easier operations 
bear comparison unfavorably. 
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4. The schools appear to take too much for granted in the 
teaching of fractions. Comparison shows that an expectation of 
better scores is warranted. 

The statistical findings of the tests and their interpretation 
suggest that considerable attention may profitably be given to the 
actual teaching situation. The principal and teacher may profit- 
ably concern themselves with the following questions: 

How does the school compare with the national and the local 
standards ? 

Which of the grades is below the standard ? 

Is this inferiority due to poor preparation, insufficient attention 
to the subject, racial handicap, poor teaching, or poor supervision ? 

What grades are above the standard ? 

Is this superiority due to superior school, and home conditions, 
superior native ability, superior teaching, or an unwarranted over- 
expenditure of time upon this particular subject ? 

Are the several duplicate grades producing comparable returns ? 

Are any of the grades showing wide group variability and 
thereby revealing a need of partial reclassification ? 

Do certain grades need to spend more time upon certain specific 
lines of work, and does the course of study need to emphasize these 
more fully? 

. The teacher may profitably be concerned with the following: 

How does my grade compare with the national and local stand- 
ards? 

How shall I interpret a possible superiority — superior native 
ability of the pupils, better environmental conditions, better teach- 
ing, excessive amount of time spent herein, or as chance ? 

How shall I interpret a possible inferiority — inferior native 
abilities or environmental handicap, too little attention to the sub- 
ject factors of chance, or poor teaching ? 

If the last named, how can I improve the condition — give more 
time to the item of demonstrated weakness, present material in a 
different manner, work more with lower fourth of group and give 
individual attention herein, and be attentive to errors of demon- 
strated frequency of occurrence and persistency ? 



